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Abstract 

We give an 0{n\og^ n) approximation scheme for 
Steiner forest in planar graphs, improving on the pre- 
vious approximation scheme for this problem, which 
runs in 0{n^^'^^) time. 



1 Introduction 

In the Steiner forest problem, we are given an undi- 
rected graph G with edge-lengths and a set V of pairs 
{si,ti) of vertices. The pairs are called demands, and 
the vertices that appear in demands are called ter- 
minals. The goal is to find a minimum-length forest 
F that, for every demand (si,^^), contains a path in 
F from Si to ti . This problem generalizes the Steiner 
tree problem in networks. 

There is a polynomial-time 2-approximation algo- 
rithm \V , but the problem doesn't have an approx- 
imation scheme unless P=NP [31 [TB]. However, for 
restricted input classes, polynomial-time approxima- 
tion schemes have been found. For the case where 
the vertices are the points on the plane and edge- 
lengths are Euclidean distances, Borradaile, Klein, 
and Mathieu "5^ give an approximation scheme that 
can be implemented in 0{n log n) time where n is the 
number of terminals. 

For planar graphs, Bateni, Hajiaghayi, and 
Marx [2^ give a polynomial-time approximation 
scheme. The running time, however, for obtaining a 
(1 -|- e)-approximate solution has the form n"^ . The 
degree of the polynomial grows as e gets smaller. An 
efficient polynomial-time approximation scheme is an 
approximation scheme whose running time has the 
form 0{f{e)n'^) for some function / and some con- 
stant c independent of e. Thus the approximation 
scheme of Bateni, Hajiaghayi, and Marx is not an 
efficient PTAS in this sense. Our main result is an 
eflacient PTAS: 
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Theorem 1.1. For planar Steiner forest, there is 
an approximation scheme whose running time is 
0(nlog^ n). 







Figure 1: Left: a carving. Right: a branch decomposition. 



2 Techniques 

2.1 Branchwidth 

Tree-decomposition and branch-decomposition are 
ways to map pieces of tlie graph to nodes of a tree 
so that, loosely speaking, pieces have small overlap. 
We formally define branch-decomposition. The gen- 
eral paradigm is to reduce the problem to graphs of 
bounded tree- or branch-width. 

A carving of a ground set is a maximal family C of 
mutually noncrossing subsets of the ground set. In 
this paper, we refer to the sets in C as clusters. The 
Hasse diagram of the inclusion partial order on the 
clusters is a tree in which each node has zero or two 
children. 

A branch- decomposition of a graph G is a carving 
C of the edges of G. The boundary dC of a clus- 
ter C is the set of vertices v such that C contains 
a proper nonempty subset of the edges incident to 
V. The width of a branch-decomposition C of G is 
max{|9G| : C G C}. The branchwidth of G is the 
minimum width over all branch-decompositions of G. 

Treewidth (not defined here) is within a constant 
factor of branchwidth. Graphs of bounded treewidth 
and branchwidth are tree-like, and many problems 
can be solved exactly in linear time on such graphs. 
As we will see, this is not the case for Steiner forest. 

2.2 Framework 

The approximation scheme of Bateni, Hajiaghayi, 
and Marx fits into the framework of Klein [T3] , which 
consists of the following steps: 

spanner Find a subgraph Gi (called the Steiner tree 
spanner) of the input graph Go such that, for 
constants c and d, 



1. length(Gi) < cOPT(Go)), where OPT(Go) 
is the total length of the edges used by an 
optimum solution, and 

2. OPT(Gi) < (l-hde)OPT(Go). 

thinning Partition the edges into p subsets such 
that the contraction of any subset yields a graph 
of branchwidth 0{p). Let G2 be the graph ob- 
tained from Gi by contracting the subset S hav- 
ing the smallest total length. 

dynamic programming Find an (approximately) 
optimal solution in G2. 

lifting Lift the solution in G2 to a solution in Go by 
uncontracting edges of S and adding them to the 
solution as needed. 

This presentation of the framework differs from the 
original in [13 in that, in the original, the dynamic- 
programming step finds an optimal solution. 

Suppose the solution in the dynamic-programming 
step has length at most (1 -I- c'e)0PT(G2). Since 
contraction preserves connectivity, 0PT(G2) < 
OPT(Gi). By the spanner property, OPT(Gi) < 
(1 -I- de) OPT(Go). The addition of some edges from 
S in the lifting step increases the length by at most 
length(Gi)/p. We choose p = c/e so the additional 
length is at most e OPT(Go). Hence the length of the 
final solution is ((1 + c'e)(l + de) + e) OPT(Go). 

The dynamic-programming step is straightforward 
and takes linear time; the construction is given 
in [13 . (See also [S]. (It has been general- 
ized to bounded-genus graphs [5] and, more recently, 
to minor-excluded graphs [5].) The lifting step is 
problem-dependent but straightforward for the prob- 
lems (such as TSP, Steiner tree, and Steiner forest) to 
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which the framework has been successfully applied. 
The spanner and dynamic-programming steps are 
problem-dependent. It is in those steps that Bateni, 
Hajiaghayi, and Marx |2] introduced new techniques, 
and it is there that our improvements go. 

2.3 Spanner 

The spanner step of Bateni et al. built on the same 
step in the Steiner tree PTAS of Borradaile, Klein, 
and Mathieu [H [6] . The proof of the latter can be 
adapted to show the following. For an instance of 
Steiner forest consisting of a graph G and a set T) of 
terminal pairs, let OPT(G,I?) denote the optimum 
value. 

Lemma 2.1 (Borradaile et al., adapted). For a num- 
ber e > 0, a planar graph Gq, and a tree T of Gq, 
there is a subgraph H of length /(e) length(T) such 
that, for any set V of pairs of terminals belonging to 
T, OPT{H,V) < OPT(G'o,r') elength(T), where 
/(•) is a fixed function. Furthermore, H can be con- 
structed in O(nlogn) time for fixed e. 

To use this result, Bateni et al. introduced an al- 
gorithm called prize- collecting (PC) clustering. 

Theorem 2.1 (Bateni et al.). There is a polynomial- 
time algorithm that, given a number e > and a ( not 
necessarily planar) Steiner-forest instance (G,I?), 
outputs a partition ViU- ■ -WDi ofD and correspond- 
ing trees Ti, . . . ,Ti such that 

1. the terminals comprising Vi belong to the tree 

2- Eti length(r,) < (f + 2) OPT(G,P), and 

3- ELiOPT(G,I?,) < (l + e)0PT(G,2?). 

To obtain a spanner for the Steiner forest instance 
{Gin, 'Din), therefore, one can perform PC clustering, 
and then, for each tree Ti, apply Lemma [2.1| to obtain 
a spanner Hi. The third property of PC clustering 
implies that the union [ji^i Hi will be a Steiner-forest 
spanner for the original instance. 

Gassner [TD] showed that Steiner forest is NP-hard 
even in graphs of treewidth 3. Bateni et al. ad- 
dressed this difficulty by giving an (inefficient) PTAS 
for Steiner forest in bounded-treewidth graphs, one 
that takes n<^("''/') time (w=width). 



2.4 Our improvements to the spanner step 

When we try to obtain a quasi-linear approximation 
scheme, the PC-clustering algorithm of Bateni et al. 
fails us in two ways. 

• The running time is 0{n^ log n). Indeed, the run- 
ning time is given in [5] as "polynomial" ; beyond 
that, it does not matter since the overall time for 
their approximation scheme is 0{nf^'^'^). 

• Once the trees Ti, . . . , T^. are found, a spanner Hi 
needs to be found for each tree. Finding a span- 
ner, given Ti, takes O(nlogn) time, so the over- 
all time for finding the spanners is 0{kn\ogn). 
Since k is ^l{n) in the worst case, the bound is 
0{n^ logn). 

We give a PC-clustering theorem that addresses both 
issues: the algorithm runs in O(nlogn) for planar 
graphs (in fact, for any excluded-minor family) and 
it returns subgraphs Gi , . . . , G^ with small overlap 
(each edge is in O(logn) subgraphs) in which the 
spanners can be foundj^ 

Theorem 2.2 (New PC-clustering). For any 5 > Q, 

there is an algorithm that, given e > and a Steiner 
forest instance {G,T)), outputs a partition T>iU- ■ -WDi 
ofD and corresponding trees Ti, . . . ,Te and subgraphs 
Gi 
1. 



2. 

3. 



. . ,Gi such that 

the terminals comprising T>i belong to the tree 
Ti 

Eti length(r,) < (^ + 2) OPT(G,I?), 

Eti OPT(G„ 2?.) < (1 + e) OPT(G, V), 
4. each edge of G is in O(logn) of the subgraphs. 
If the input graph G is simple and planar or, more 
generally, comes from a fixed excluded-minor family, 
the running time of the algorithm is 0(n log n). 

Combining this algorithm with the 0{nlogn) con- 
struction of Lemma 
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we obtain an 0{n\og n) 
algorithm for obtaining a Steiner-forest spanner for 
simple planar graphs. 

PC-clustering is but one example of the use of 
primal-dual approximation algorithms in approxima- 
tion schemes for planar, bounded-genus, and minor- 
excluded graphs. Our technique for speeding up 



^PC-clustering can be stated in a somewhat more general 
way and is used in this way in multiterminal cut; our result 
actually addresses the more general problem. 
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PC-clustering in planar and bounded-genus graphs 
in such graphs applies to other primal-dual approx- 
imation algorithms as well. For example, the tech- 
nique can be used on the Goemans- Williamson ap- 
proximation algorithm for prize-collecting TSP and 
prize-collecting Steiner tree. As a consequence, we 
obtain 0(n log n) approximation schemes for these 
problems in planar and bounded-genus graphs. The 
speed-up in the algorithm comes from use of a dy- 
namic data-structure [71 114j for maintaining orienta- 
tions, together with ideas from a data structure [H] 
for efficient implementation of primal-dual approxi- 
mation algorithms. 

2.5 Our improvements to the dynamic pro- 
gramming step 

When we try to obtain an efficient approximation 
scheme, the dynamic program of Bateni et al. fails us 
in one way: each tree that crosses the boundary of a 
cluster is approximately represented by 0{w/e) of its 
vertices, and there are rP'^'^/'^^ possible such vertex 
choices. 

We take advantage of the spanner property in 
combination with the bounded branchwidth prop- 
erty. Recall that the graph Gi has length at most 
cOPT(Go), and hence so does the graph G2 result- 
ing from the thinning step. In Section |4j we prove 
the following: 

Theorem 2.3. For any constant e > Q, there is 
an 0{f{w)nlog'^ n) algorithm that, for any instance 
(G, V) of Steiner forest of branchwidth w, finds a so- 
lution of length at most OPT(G,I?) -|- elength(G), 
where /(•) is a fixed function^ 

We achieve this using a new graph construction 
on branch-decompositions. For each cluster, if the 
sum of lengths of edges near the cluster's boundary 
is high then the edges are contracted. The result is a 
graph in which, for each cluster, the sum of lengths of 
edges near the cluster's boundary is not too big. We 
can therefore cover the region near the boundary by 
a constant number of regions of low diameter. This 
simplifies the dynamic program since it doesn't have 



to keep track of exactly where the terminals are — 
just which regions contain them. Since the number 
of regions is constant, we can get by with fewer con- 
figurations. 

The situation is a bit more complicated because 
the dynamic program has to deal with regions at dif- 
ferent scales, and has to guess the scales. We show 
it suffices to guess among a number of scales that is 
logarithmic in the height of the branch-decomposition 
tree and exhibit a linear-time algorithm that, given 
an arbitrary branch decomposition of width w, finds 
a new branch decomposition of width 2'w and log- 
arithmic height. Another complication is the edges 
that were contracted in the graph construction. We 
show that, after uncontracting these edges, the opti- 
mal solution can be patched so that its length does 
not increase much. Consequently, the solution found 
by the dynamic program has length not much more 
than optimal. 

Combining Theorem \2.2\ Theorem |2.3[ the 
0{n\ogn) spanner construction of [B] described in 
Lemma |2.1[ and the framework, we obtain Theo- 
rem [TTTJ 



3 Proof of Theorem 



2.2 



^There is no great significance to our changing from 
treewidth to branchwidth. 



We describe the algorithm that proves Theorem |2.2[ 
It involves just a small change to the PC-clustering 
algorithm of [5], although our presentation is differ- 
ent. 

3.1 Algorithm for basic PC-clustering with 
graph decomposition 

In contrast to [2], we describe PC-clustering using 
contractions. When an edge uv is contracted, the 
endpoints are coalesced to form a new vertex. The 
variable t represents simulated time. Part of the 
input is an assignment of "energy" to vertices, 
that, over time, consumed. When two endpoints u,v 
of an edge are coalesced, the new vertex combines 
their remaining energy. We say a vertex?; is living if 
it has not yet exhausted its energy, i.e. 4>[v] > (else 
dead), and d[v] represents the amount of (simulated) 
time V has lived so far. Our substantive change is to 
introduce the notion of "zombie" vertices [15^, which 
are vertices that are joined to living vertices not 
too long (depending on a parameter 6) after they die. 
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PC-clustering, Phase 1: 

input: an initial graph G with edge-lengths length(-), and an initial assignment <p of budgets to vertices 

t := 0: SAVE := 

for each vertex v, d[v] := 

while there is a living vertex 

Ai := min{(^[w] : v G V{G),v living} 

A2 := min {length(uz;) : uv G E{G), one of {u,v} is living} 

U {length(uu)/2 : uv G E{G), both u and v are living} 
A := min{Ai, A2} # which happens first? 
t := t + A =ff advance time 
for every living vertex u, 
d[u] := d[u] + A 
:= (j)[u] - A 

length(ut;) := length(wu) — A for every incident edge uv 
if some edge uv now has zero length, 
contract uv, creating new vertex w 
t assign (p[w] := (j)[u] + cp[v] and d[w] := max{d['u], d[v]} 

if some cndpoint (say v) is not living but t < {1 + 5)d[v] then add uv to SAVE 
Fi := {edges contracted} 
PC-clustering, Phase 2: 
initialize F2 '■= Fi 

while there is an edge e G F2 — SAVE that is the only edge incident to a dead vertex v 
delete e from F2 



Figure 2: The new PC-clustering algorithm 
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Let Go denote the graph G before the contractions of 
Phase 1. In the following, unless otherwise stated, the 
term vertices includes the original vertices of Go as 
well as the new ones formed by contraction. For each 
vertex v, let cjiolv] denote the initial value of (j)[v], the 
value when it is first assigned (whether before Phase 1 
commences, in the case of vertices of Go, or in line f 
for vertices created by contractions). 

The contractions define a binary forest, the con- 
traction forest, on the vertices. If an edge uv was 
contracted and the resulting vertex is w then u and 

V are the two children of w in the contraction forest. 
For each vertex v, let Sy be the set of vertices of 

Go that were coalesced to form v, and let Gy be the 
subgraph of Go induced by Sy. 

We say an edge e of Go is incident to a vertex v 
if exactly one of the endpoints of e in Go belongs to 

Sy . 

A vertex v is isolated if, at the end of Phase 2, no 
edge of F2 is incident to it. We define the isolated- 
dead-vertex forest X to be the forest whose nodes are 
the isolated dead vertices and such that the parent of 

V is its nearest proper isolated dead ancestor in the 
contraction forest. 

Lemma 3.1. For each vertex v e V(T), there is 
a connected component Ty of F2 whose vertex set is 

Sy — [J{Syy I W O CMld Of V 171 I}. 

Lemma 3.2. The depth of I is at most 1 + 
J E{0oM : uGVjGo)} 

min{0oM : v G ^(Go), 0oH > 0} ' 

Proof. At the end of Phase 1, for each dead vertex 
V, d[v] is the time when v died. Each root r of I 
has d[r] < J2{'t'o[u] : u G V{Gq)}. Suppose w is a 
nonroot vertex of I, and let w be the parent of v in 
I. Let uv be the edge contracted to form w. Since v 
is isolated, uv does not remain in F2, so w ^ SAVE. 
Therefore the time t at which uv is contracted must 
satisfy t > {I + 5)d[v]. Therefore d[w] > (1 + 5)d[v]. 
This proves the lemma. □ 

The output of the algorithm is the forest X and, for 
each vertex v of I, the subgraph Gy and the con- 
nected component Ty of Lemma |3.1| Lemma |3.2| im- 



plies that each vertex/edge of Go is in a logarithmic 
number of subgraphs. 

3.2 Length of forest returned by basic PC- 
clustering 

Lemma 3.3. The forest returned has length at most 
2(l + 5)E{0oH : v^V{G^)}. 

Proof. For each value of t, let Living(t) denote the 
set of vertices that are living at time i, and let 
RecentDead(t) denote the set of vertices v such that 
(time of u's death) < t < (1-1- 5) (time of v's death) 

For each vertex v, define r['i;] = E{(/)o[m] : u G 
Sy} — 4>[v]. Intuitively, t[v] is the amount of energy 
"used up" by v and its descendants in the contraction 
tree. The algorithm ensures that (f)[v] remains non- 
negative, so t[v] < X]{'^o[w] : u G Sy}. Induction 
shows d[v] < t[v]. 

In Phase 1, when an edge is added to Fi, its its 
reduced length is zero. The reduction in the length 
of edge uv can be attributed to the endpoint(s) living. 

For time t, let Gt be the graph G at time t, and 
let Ht be the edge-subgraph of Gt consisting of edges 
that are in F2 at the end of Phase 2. The total length 
of F2 is at most / Et,eLivi„g(t) (degree of v in Ht) dt. 
Ht is a forest whose leaves are clusters that are living 
or recent dead. The degree in H of dead clusters is at 
least two, so EceLiving(t) degH(G) < 2(|Living(<)| + 
|RecentDead(i)|). Therefore the total length of 
F2 is at most / 2(|Living(i)| -I- |RecentDead(t)|) dt. 
While a vertex v is living, is decreasing at unit 
rate, so t[v] is increasing at unit rate. This shows 
/|Living(t)|dt < Y.{M ■■ V e y(Go)}. By the 
definition of RecentDead(i) and d[v] < t[v] shows 
/|RecentDead(i)|)di < SJ^Mv] ■ v e V{Go)}. 
This proves the lemma. □ 

Remark The only difference between this analysis 
and that of [2] is the part dealing with RecentDead(f). 

Recall that when two vertices coalesce, the result- 
ing vertex gets the remaining energy from its end- 
points. Therefore each bit of energy possessed by a 
new vertex v comes from some original vertex u G Sy. 
Following we think of the energy originally as- 
signed to u as having the color u. If some of w's 
energy comes from original vertex u, we will say that 
v has color u. Let E' be a set of edges of Go. We 
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say a color u is exhausted by E' if every vertex v col- 
ored by u has an incident edge in E' . These concepts 
yield: 

Lemma 3.4 (Bateni et al.). Let L be the set of col- 
ors exhausted by E' . The length of E' is at least 

Lemma 3.5. An original vertex u is exhausted by 
E' if for some dead vertex v such that u € Sv, E' 
contains a path between u and some original vertex 
not in Sy. 

3.3 Using PC-clustering in Steiner forest 



Now we prove Theorem 2.2 (The proof of the run- 
ning time is in Section |3.4[ ) The input instance of 
Steiner forest is (Gi„,I?). The algorithm finds a 
2-approximation solution F* , and then obtains the 
graph G from Gin by contracting each connected 
component K of F* . Let k be the number of com- 
ponents. Let Y be the set of components of length 
< ^length(F*). For each component K not in Y, 
the algorithm assigns energy to the vertex m of G re- 
sulting from contracting K: :— 2e^^ length(_fi'). 
All other vertices of G are assigned zero energy. 

The algorithm runs Phase 1 and 2 of Section 3.1 on 
G and obtaining F2 and the isolated-dead- vertex 
forest I. For each v S V{X), the algorithm obtains a 



subgraph Gy (see Section 3.1) and (sec Lemma 3.1) 
a connected component Ty of i^2- For each, the al- 
gorithm obtains Gy from Gy and from Ty by un- 
contracting the edges of F* , and defines T>y to be the 
set of demands (s, t) eV for which s,t e V{T^). We 
claim that these structures satisfy the conditions in 
Theorem |2.2[ The first condition is satisfied by con- 
struction. Lemma 3.3 implies that the second con- 
dition is satisfied. Since each vertex of G is initi ally 
assigned energy at least j, length(F*), Lemma 



3.2 



plies that I has depth < 1 + logj^^^ k, which miplies 
the fourth condition. 

It remains to show the third condition, that 
the sum of optimum values for the subinstances 
{{G'y,T>y) : V G V{I)} is at most 1 + e times the op- 
timum value for the original instance {GimT)). Since 
I is a rooted forest, it induces a partial order on these 
subinstances. 



Let E' be the edge-set of an optimal solution to the 
original instance. Each connected component C of 
E' is assigned to the subinstance containing G that 
is farthest from a root in I, i.e. the instance for 
which Gy is smallest. Let Hy be the subgraph of G'^ 
consisting of the components assigned to {G'y.Vy). 

Since Hy might not constitute a feasible solution 
for that instance, we might need to augment them. 
Suppose that there is a demand (s, t) e Vy such that 
s and t are not connected by Hy . The 2-approximate 
solution F* contained some connected component 
K that joined s and i; let uk be the vertex in 
Go that resulted from contracting that connected 
component. Since E' is a feasible solution, it too 
contained a connected component that joined s and 
t] since that component was not assigned to the 
subinstance {G'y,T>y), it must be that the component 
is not contained in G'y, so, by Lemma 3.5 uk is 
exhausted by E' . To augment iJ^, we add the 
component K of the 2-approximate solution. In 
the augmented solution, s and t are joined. Ei- 
ther K belongs to Y or — 2e^^ \ex\gih{K). 
The sum of the lengths of components in Y is 
< k ■ 41ength(F*) < f length(£;'), and Y.{^[^k] : 
Uk is exhausted by E'} < length(iJ'), so 
^{length(if) : K ^ Y,uk is exhausted by E'} < 
elength(i?'). Therefore the sum of lengths of solu- 
tions to the subinstances is < (1 -I- e) OPT(Gi„, I?). 
This proves the third property of Theorem |2.2[ 



3.4 Primal-dual on planar and minor- 
excluded graphs 

We show that some primal-dual approximation al- 
gorithms, including Goemans and Williamson's ap- 
proximation algorithm for Steiner forest, and Bateni, 
Hajiaghayi, and Marx's algorithm for PC clustering 
(and our modification of this algorithm), can be im- 
plemented in 0(n log n) time for planar graphs. 

The method is to combine an approach of [T^] to 
implementing primal-dual approximation algorithms 
with a technique of [7. 

3.4.1 Interface to data structure 

Klein [H] shows that primal-dual algorithms such as 
that of [11] can be implemented using a data struc- 
ture. There are two categories {active and inactive 
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in the case of primal-dual). An ordered pair (c, c') 
of categories is called a bicategory. Each vertex v is 
assigned a to category c{v) , and thus each edge uv is 
assigned to a bicategory. The data structure supports 
the following operations: 

• DecreaseCost(6, S), where 6 is a bicategory and 
(5 is a real number, decreases by 6 the cost of all 
edges in bicategory b. (2) 

• FindMin(6) returns the minimum-cost edge in bi- 
category b. 

• ChangeCategory(w, c) changes the category of 
u to c (implying changes to the bicategories of 
edges incident to v). 

• CONTRACTEDGE(e, c) contracts e and assigns the 
resulting vertex to category c. 

3.4.2 Representation 

Now we describe the data structure. We use the 
ideas of [12] but make some changes to allow the data 
structure to be made more efficient for graphs from 
a minor-excluded family. 

The data structure maintains the following: 

• an orientation of the edges; 

• an array C[-], indexed by vertices, such that C[v] 
is the category of v; 

• an array OUT[-], indexed by a vertex u, such that 
OUT[v] is a linked list of the outgoing edges of v; 

• an array OUT[-, •], indexed by a vertex v and a 
category c, such that OUT[v, c] is a linked list of 
the outgoing edges of v whose tails are in cate- 
gory c; 

• an array IN[-, •], indexed by a vertex v and a cat- 
egory c, such that IN[v, c] is a pointer to a merge- 
able heap consisting of the incoming edges uv of 
V for which the tail u has category c; 

• an array B[-], indexed by bicategories, such that 
B[{ci,C2)] is a heap consisting of {IN{v,C2) ■ 
C[v] = ci}. 

Each edge and each heap has a real-number label. 
The data structure maintains the label invariant: the 
cost of an edge is the sum of its label, the label of the 
heap IN[v, b] that contains it, and the label of the 
heap B[b] that contains its heap. The key of an edge 



in the priority queue that contains it is the edge's 
label. The key of a queue IN[v,b] in the priority 
queue B[b] that contains it is the label of IN[v,b] 
plus the minimum key in IN[v, b]. 

3.4.3 Implementing DecreaseCost and FiND- 

MlN 

DecreaseCost(6, (5) is implemented by decreasing 
the label of IN{b) by S. FindMin(6) is implemented 
by finding the minimum heap in B{b), and returning 
the minimum edge in that heap. 

3.4.4 Implementing ChangeCategory 

(f , c) Now we describe how to implement the oper- 
ation CHANGECATEGORY('y, c). Let Cq :— C[v] (the 
old category of v). To handle the incoming edges of 
V, for each category c', the heap IN[v,c'] is moved 
from S[(c', Co)] to B[{c' , c)] (and the label of IN[v, c'] 
is adjusted to preserve the label invariant). To han- 
dle the outgoing edges of v, each edge vu in OUT[v] 
is moved from IN[u, cq] to IN[u, c] (and the label of 
vu is adjusted to preserve the label invariant). Time 
required is 0(|outgoing edges] logn). 

3.4.5 Implementing ContractEdge 

To implement ContractEdge(to, c), first delete 
the edge uv, and change the categories of u and v to c. 
Let w denote the vertex to be formed by coalescing u 
and v. For each category c', merge the heaps IN[u, c'] 
and IN[v, c'], and assign the result to IN[w, c']; sim- 
ilarly, merge the lists OUT[u, c'] and OUT[v, c'] and 
assign the result to OUT[w, c']. Remove the edge uv 
from the heap and list containing it. Update the ta- 
bles to reflect the fact that u and v no longer exist. 
The time required is 0( (| {outgoing edges of m}| -|- 
I {outgoing edges of i;}|)logn). 

3.4.6 Maintaining bounded outdegree 

The time per operation is O(logn) if the outdegree 
of each vertex is bounded. Brodal and Fagerberg [7] 
give a method for dynamically maintaining bounded- 
outdegree orientations in families of graphs that guar- 
antee the existence of such orientations. Kowalik [T3] 
points out that their method works with contrac- 
tions. Each update takes amortized O(logn) time 
and changes the orientation of O(logn) edges. 
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4 Proof of Theorem 2.3 



Now we prove Theorem 2.3 We are given an instance 
{Gin, 'Din) of Steiner forest, and a branch decomposi- 
tion of Gin of width w. For simphcity of presentation, 
we want to assume that each edge has length 1. To 
justify this assumption, let 77 = elength(Gi„)/(cm), 
and define a new length assignment length(e) :— 
[length(e)/?7j . Now all the lengths are integers, and 
the sum of lengths is at most ce~^m. Replace edge 
e with length(e) edgelets (if length(e) = then con- 
tract e) to achieve the assumption. Given a solution 
for the modified instance, the additional length due 
to rounding is at most 77m, which by definition of 77 
is less than e OPT(G.m, I?i„). 

4.1 Reducing the height of the branch de- 
composition 

Lemma 4.1. LetC be a branch decomposition rooted 
at C of width at most w. The output C — 
Balance(C) is a branch decomposition rooted at C 
0/ width at most 2w, and, for all edges e G C, there 
exist at most 3 log2 m + 1 clusters D G C such that 
e€ D. 

There exist linear-time implementations of Bal- 
ance and Complete. 

Proof. For all clusters D £ C , either D G C, ot there 
exist clusters Di,D2 G C with Di D D2 such that 
D = Di — D2. In the latter case, since dD C dDi U 
dD2 and \dDi\, \dD2\ <w,it follows that \dD\<2w. 

In order to prove the bound on the number of clus- 
ters containing a particular edge, we define a binary 
function S = S{mi, . . . ,mk). If mi = 1 and, for all 
i G {2, . . . , A:}, it holds that mi + • • • + mi^i > mi, 
then 5 = 0. Otherwise, ^ = 1. The use of a heavy 
path in Balance ensures that every root invocation 
of Complete has 6 = 0. 

The bound follows directly from this claim, which 
we prove by induction on k: for all inputs Ci, . . . ,Cfc 
to Complete, for all i G {1, . . . , k}, there exist at 
most 3(log2 771— log2 mi)+26 clusters D G C' such that 
D ^ Ci. The base case fc = 1 is trivial, since C = Ci. 
When fc > 1, let m^j = mi + • • • + mj^i and m^j = 
mj+i -|- • • • -I- 77ifc and (5<j ~ 6{mi, . . . , mj-i). By the 
choice of j, it holds that 7Ti<j < 777/2 and m^j < m/2. 
For alH G + 1, . . . , fc}, it follows by the inductive 



hypothesis that there exist at most 1 + 3(log2 m^j — 
\og2 mi) + 2(5>j < 3(log2 777 — log2 777i) clustcrs D G 
C such that D D C,. For aU i G {1, . . . , j - 1}, 
there exist at most 2-|-3(log2 m^j — \og2 mi) + 2S^j < 
3(log2 777 — log2 mi) + 26 clusters D £ C such that 
D ^ Ci, since 2i5<j < 26. Lastly, there exist at most 
2 < 3(log2 777 — log2 777j) + 26 clustcrs D € C such 
that D ^ Cj, since if (5 = then mj < m/2. □ 



4.2 A framework for Steiner forest in graphs 
of bounded branchwidth 

In this section, we make explicit the dynamic pro- 
gramming framework of Bateni et al., adapted to 
branch-decomposition instead of tree-decomposition. 

Definition 4.1. With respect to a cluster C , a ver- 
tex u G V{C) is active if it either belongs to dC 
or participates in a demand {u, u} G 2? such that 
V ^ V{C) - dC (see Figure^. We use active(C) to 
denote the active vertices. A demand {u, v} € T> is 
active if either u or v is active. 

Definition 4.2. Given two partitions Pi and P2, let 

Pi V P2 denote the finest partition coarser than both 
Pi and P2 . 

Definition 4.3. With respect to a cluster C, a con- 
figuration (vr'", 7r°"', tt''") is a triple consisting of 
partitions 7r'",7r°"* of dC and a partition tt'^'' of 
active(C) such that 



\dC 



TT \/ IT 



For a subgraph F , in the canonical configuration of 
C , tt'" is the connectivity of dC in F nC , 7r°"* is the 
connectivity of dC in F — C, and tt'^" is the connec- 
tivity 0/ active (C) in F (see Figure^. A subgraph 
F and a configuration (tt™, 7r°"*, tt'''') are compatible 
if tt'" is the connectivity of dC in F O C and tt^'' is 
the connectivity of active(C) in {F Ci C) V 7r°^* (see 
Figure Note that compatibility is determined by 
the edges in F H C only. 

Proposition 4.1. With respect to each cluster C, a 
subgraph F is compatible with its canonical configu- 
ration. 
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Balance(C): 

Input: a branch decomposition C with root cluster C. 

Output: a balanced branch decomposition C with root cluster C. 

Let Hi C H2 ■ ■ ■ Hk = C he a heavy path, that is, 

a maximal ascending chain such that, for alH G {1, . . . , fc — 1}, y \^ 

the sibling -ffj+i — Hi of Hi satisfies — Hi\ < \Hi\. (J' 

Let = ifi. For i e {2, . . . , k], let C'i=Hi- Hi_i. / \ 

For i e {1, . . . , /c}, let = Balance({£) :DeC, DC Q}). H2 C3 

Let C = CoMPLETE(C;, . . . ,C;). / \ ^ 

Complete(Ci, . . . ,Cfe): 

Input: branch decompositions C'i with root clusters C^. 

Clusters C( , . . . , arc pairwisc disjoint. 
Output: a branch decomposition C D C[U ■ ■ ■ U C'f, 

with root cluster C[LI---LlC'^. 
For i e let rrii = |C^|. 

Let m — mi + • • • + ruk- 

Find (via binary search) the least index j G {1, . . . , A;} 

such that mi H + 'mj > m/2. 

lij > 1: 

Let A = COMPLETE(Ci, . . . ,C^_i). 

LetB = AU{C'iU---U C'j} U C'j. 

Else: 

Let B = 

If j < k: 

Let V = Complete(C^_^i, • • • ,C^). 
Let C = B U {C} U V, where C = C( U • • • U C^. 

Else: 

Let C = S , , 

C( U • • • U C 




Figure 3: Algorithm for balancing a branch decomposition. 
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C Sk 




Figure 4-'- Active vertices. Consider cluster C and demand pairs {si^U}, {sj,tj} and {sk,tk}- Here, Si,Sj,tj and u 
are active vertices of C but U, Sk,tk are not. 




Figure 5: Canonical configuration. Consider the subgraph F depicted above, and assume that a, b, c, d, e, f, g are 
the active vertices of cluster C. Then the canonical configuration of C for F is: 7r'° = {a}, {6, c}, {d}, {e}, 7r°"* = 
{a, 6}, {c, d}, {e}, and tt"^" = {a, h, c, d, g, h}, {e, /}. 




Figure 6: Compatibility of a subgraph with a configuration. If the set of active vertices is {a, b,c,d,e, f,g,h}, then the 
subgraph F depicted above is compatible with the configuration tt™ = {a}, {&, c}, {d}, {e}, 7r°"* = {o, 6}, {c, d}, {e}, 
and tt''" = {a, 6, c, d, g, /i}, {e, /} (left side). It is also compatible with the configuration a'" = tt'", ct"'"* = 
{a},{b},{c,d,e}, and c"" = {a,h},{b,c,d,e, f,g} (right side). 
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Definition 4.4. Let Cq be a cluster with child 
clusters Ci and C2. For i G {0,1,2}, let 
(tt™, 7r°"*, 7rf") he a configuration with respect to Ci. 
The configurations (Trj,", tt™*, Trf}"), (Tri"^, tt?"^*, Trf ) 
and (tt™, TTj"*, 7r|'') are compatible if all of the fol- 
lowing conditions hold. 

• TTg" = (tt™ V 7r2")|aCo' internal connectivity of 
the parent is the join of the internal connectivity 
of the children. 

• For i e {1,2}, tt holds that tt™' = (vrg"' V 
'!^3!-i)\dCi- the external connectivity of a child is 
the join of the external connectivity of the parent 
and the internal connectivity of the other child. 

• Forie {0, 1, 2}, it holds that irf = {irf V nf V 

''■2")lactivc(C.)- 

See Figure [7] for an example. 

Lemma 4.2. Let F be a subgraph. Then, for every 
cluster Co with child clusters Ci and C2, the canoni- 
cal configurations of F with respect to Cq, to Ci and 
to C2 are compatible. 

Conversely, suppose that we have a configuration 
for each cluster, such that for every cluster Cq with 
child clusters Ci and C2, the configurations for Cq, 
for Ci and for C2 are compatible. Then there ex- 
ists a subgraph F such that the configurations are the 
canonical configurations of F with respect to the clus- 
ters. 

The configurations essentially give a local represen- 
tation of F. 

We now show how to use the representation of F 
with configurations to determine, with local condi- 
tions, whether is a feasible Steiner forest solution. 

Definition 4.5. With respect to a cluster C, a con- 
nected component is outgoing if it intersects dC. A 
configuration is outgoing if every part of tt^'^ inter- 
sects dC. 

Proposition 4.2. If F is a Steiner forest solution, 
then, with respect to a cluster C , for each active ver- 
tex u, the tree of F containing u is outgoing. 

Definition 4.6. Let Cq be a cluster with child clus- 
ters Ci and C2 . Three configurations for Co , Ci , C2 



are demand-consistent if they are compatible, outgo- 
ing, and if the following condition holds in addition: 
For all demands {s,t} active for Ci and C2 but not 
Cq, terminals s and t are related by tTq" V ttJ'" V 7r|". 

For example, in the example of Figure [Tj the con- 
figurations are demand-consistent because terminal 
g is related to terminal g': in nf^^, g is connected to 
d, and in 7r|'', d is connected to g' . For an example 
where ttq'^ comes into play, see Figure [sj 

The following lemma establishes that a subgraph 
represented by its canonical configurations is a feasi- 
ble solution if and only if the configurations are con- 
sistent with one another. 

Lemma 4.3. A subgraph F is a Steiner tree solu- 
tion if and only if, for every cluster Cq with children 
Ci,C2, the canonical configurations of F with respect 
to Cq,Ci and C2 are demand- consistent. 

Now, suppose that for each cluster C, we restrict 
attention to a subset He of the configurations of C. 
We call those configurations simple. Then the above 
setup leads to a dynamic program to find the shortest 
Steiner forest F such that for every cluster C, the 
canonical configuration of C for F belongs to He . The 
dynamic program works as follows. 



The runtime of that dynamic program is 
0(n(maxc |nc 1)'^), times the cost of checking 
that three configurations are demand-consistent. 

Bateni et al. proposed a definition of He such that 
maxc and proved that, under their 

definition, there exists a near-optimal forest such that 
for every C, the canonical configuration of C be- 
longs to He. Here, building on the spanner prop- 
erty, we propose a different definition of He, such 



For each cluster Co in bottom-up order, 
if Co is a single edge e, 
then the cost of a configuration is either 1 or 
depending on whether e needs to be in. 
else, let Ci and C2 denote the two children of Co; 
for each configuration ttq G Hc^ , 
cost(7ro) := min(cost(7ri) -|- cost(7r2)), 
where the min is over tti g IIcj^ and 7:2 G 
such that 7ro,7ri,7r2 are demand-consistent. 
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Figure 7: Compatible configurations. In the above example, assume that g is active for Ci and g' is active for C2, 
but neither g nor g' are active for Co. The dashed connections represent 7r°"*, the solid connections determine Tr;", 
and the union of solid and dashed edges determine 7rf". 



c 

Figure 8: s and t are related by ttq" V Trf" V Trf": s is connected to w via Trf", m is connected to v via ttq", and v is 
connected to t via Trf". 
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that maxc |nc| = 0((loglogn)^('^^). This gives us 
the improvement from an inefficient to an efficient 
approximation scheme. 

In a nutsheh, here is the idea: if the optimal forest 
has several trees that come close to the same ver- 
tex V of dC, then we connect them with paths to 
transform them into a single tree; that simplifies the 
configuration. We charge the length of that path to 
the total length of edges in the neighborhood of v in 
dC. To make sure that we do not charge the same 
edges several times over, we contract edges that get 
charged. 

4.3 Contractions in a branch decomposition 

For a graph G and a set S of vertices, SP{G, S) 
denotes thcj^ shortest-path forest rooted at S. For 
a number fc, define SP{G, S, k) = {e e SP{G, S) : 
e is in a path of length at most k starting at S}. 
That is, SP{G, S, rf) is the shortest-path forest for 
vertices of G whose distance from dG is at most 77. 

Let G be a graph with branch decomposition C. 
Fix a parameter a whose value, a function of e and 
of w, will be set later. We define a recursive algorithm 
that operates on the clusters of C. For each cluster 
C in bottom-up order, it computes a radius p*^, such 
that paths of length p*^ are much shorter (by a factor 
of a) than the number of edges within distance p*^ of 
dC. It then contracts all edges within distance p'~^ of 
dC. 



def ContracTq(C): 

= Uc. child of C CONTRACT„(C,) 

p'^ ■= max{p : \ength{S P {C / A'^ ,dC,p)) > ap} 
B'^ = SPiC/A^,dC, p^) U 
return B'^ 



What are the structural properties achieved by the 
contraction algorithm? First, the contracted graph 
C/B'-^ has linear growth rate: 

Lemma 4.4. For every p > 0, 
\ength{S P{C/B^,p)) < ap. 

^For uniqueness, assume the edges of G are assigned distinct 
ID numbers, and define SP{G, S) to be the shortest-path forest 
in which ties are broken by ID number. 



Proof. Observe the following simple property of 
contractions: SP{G, S,rii -I- 772) is the disjoint union 
of S'P(G, 5, 771) and SPiG/SP{G,S,r]i),S,r]2). 
Thus, length{S P{C/B'^,dC,p)) 

length{S P{C/A'^,dG,p^ + p)) 

length(S'P(C/A'^,9C,pC')). By definition of p^, 
SP{G/A^,dG,p'^) has total length at least 
ap'~' . By maximality of p*^, the length of 
SP{G/A^,dG,'n^ + p) is less than 0(77^ -f p). 
The lemma follows. □ 

Second, the sum of all radii of contracted areas is 
small compared to the total length of G: 

Lemma 4.5. J2cecP'~^ — lcngth(G)/a. 

Proof. By definition, p*^ < 

lcngth(S'P(GMC,9G,pC))/a. By defini- 

tion of B^, length{S P{G/A^,dC,p^)) 
lcngth(pc^) - Ec, child of clength(SC^')- Sum- 
ming over clusters G G C gives the lemma. □ 

What is the running time of the contraction algo- 
rithm? Let us explain in more detail how to compute 
p^ efficiently. Here, for each vertex, d[u] denotes the 
distance from dG to u: 



for 7 := 1, 2, 3, . . . , 

s, := lcngth({777; G SP{G/A^,dG) : d[u] = i - 1, 
d[v] - 7}) 
p^ :— max{7 : si + ■ ■ ■ + Si > ai} 



The total time for each invocation ContracTq(G) 
is linear in the number of edges of G. By Lemma [4?H 
C is a log-height branch-decomposition, so the total 
time for calling ContracTc, on the root cluster of a 
graph G of 0{n) edges is 0(77 log ri). 

4.4 Regions covering partially contracted 
clusters 

Here is a high-level description of our method for find- 
ing regions for a cluster G. Fix a parameter /3 whose 
value, a function of e and of w, will be set later. 
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Denote by 2^ the minimum power of two 
greater than max„gactivo(C) 

dist(w,aC). 

For each i — 1,2, - ■ ■ , fi'^ , 

define a set Ci of regions of C / such that 
each region has diameter at most /32\ and 
together, the regions of Ci cover SP{C/B'^ , 2*). 



It follows from Lemma 4.4 that a greedy algorithm 
produces such a covering, of size \Ci\ = 0(a//3 + 
|9C|). However, in order to get near-linear running 
time, we need an algorithm that is slightly more so- 
phisticated than greedy. In the rest of this subsection, 
we present the details of the algorithm. 

For each tree of SP{C/ B'^ , ^'^), let w be a tree 
vertex that is on the boundary dC and let % denote 
the sequence of vertices encountered on an Euler tour 
of the tree. An i-region is a subpath of of length 
at most /32' whose first vertex is at a distance at most 
2* -I- /32* from dC in C/B^ . In our algorithm, d is 
a set of i-regions. For a vertex u of C, we use d[u\ to 
denote the dC-to-u distance in C/B'~^ . 



for i :— 1,2, . . . , , 
A :=0 

for each tree of SP{C/B^,n^), 

root the tree at some vertex v e dC 
construct an Euler tour Ty of the tree 
for i := , /i*-^ — 1, . . . , 1, 

5,,, :={j732» : j € {0, 1, 2, 3, . . .} 

and d[n[jpT]] < (l + /3)2*} 
for each x £ Sy^i, 
U L, U 

{subpath of Ty of length /32* 
truncated at \Ty\) starting at T„[a;]} 

Lemma 4.6. For each z = 1, 2, • • • , /i"^, the regions 
of Ci cover SP{C/B'^ 

Lemma 4.7. For each i — 1,2,- •• \Ci\ < 

2a{l + 2/3)13-^ + \dC\. 

Proof. If a vertex u is in a subpath of Ci, then it is 
at distance at most f32^ from the starting point of the 
subpath, and so u e SP{C/B^, (1 -f 2/3)2*). So, the 



sum of lengths of the i-regions is at most the length 
of the Euler tours of SP(C/B^, (1 + 2/3)2*), which is 
at most twice the n umb er of edges in SP{C/B^ , (1-1- 
SP{C/B^, {I + 2p)2^ 



4.4 



IS at 



2/3)2*). By Lemma 
most a{l + 2/3)2*. Each j-region constructed from T„ 
has length exactly /32*, except possibly the last one. 
The lemma follows. □ 

The time for finding the covers is 0(n log n). 
4.5 Simple configurations 

Definition 4.7. Fix a cluster C with boundary ver- 
tices dC and, for each i — 1,2,- •• , (Jp , a covering 
Ci of C / B'-^ hy i-regions. Given 

• an integer d < \dC\; 

• d powers of 2 in the range [2^ /{2^),2^ ], where 
^ is a parameter to he determined later; 

• a priority ordering over those d numbers, labelled 
(2*1 , 2*^ , . . . , 2*^*) by order of priority; and 

• for each ij, a set of ij -regions Qj C d-; 

consider the subpartition of active(C), denoted 
P{ii, . . . , id, Qi, • ■ ■ , Qd) , and defined by greedily set- 
ting the jth part to be 

i-i 

Pj = (active(C) n Uncontract{ |J Q)) - |J P^, 

QeQj 1=1 

where Uncontract takes as input vertices of C/B'^ 
and outputs the corresponding vertices of C . A 
configuration (tt'", 7r°"', tt''") is simple iff tt^'^ = 
tt'" V 7r°"' V P{ii,...,id,Qi,...,Qd) for some 
{h, ■ ■ ■ ,id,Qi, ■ ■ ■ ,Qd)- 

To understand this definition intuitively, d should 
be interpreted as the number of outgoing trees. The d 
powers of 2 should be interpreted as the approximate 
"radii" of those trees - maximum distance from dC to 
an active tree vertex. As in the algorithm of Bateni 
et al., the ordering should be interpreted as giving 
priority to trees whose minimal enclosing cluster is 
smaller. As in the algorithm of Bateni et al., the 
(uncontracted) -regions should be interpreted as a 
covering of the active vertices of the jth tree. 
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For each cluster C 

define a table indexed by (ii, . 



■ Qi, • ■ • , Qd) 



Lemma 4.8. Taking a and (3 to be constant, the 
number of simple configurations is (log2 7)-^'-'^^ for 
some function f of e. The time to check demand- 
consistency is O(logn). 

We can finally state the main structural Theorem 
that is at the core of Theorem 12.31 



Theorem 4.1. For any solution F, there exists 
a solution F' D F such that for every clus- 
ter C , the canonical configuration of F' with re- 
spect to C is simple, and whose length satisfies: 
length(F') < length(F) + 4/3(2u; - 1) (l + (3 logj m + 
l)/7) length(i^) + 2a-i(2w - 1) length(G). 



4.6 Proof of Theorem 4.1 



4.6.1 Defining F' 

F' is simply an extension obtained from F by adding 
some edges. First, given F and a cluster C, we define 
a partition of the form P(?i, . . . , i^, Qi, . . . , Qd)- 

Let d be the number of outgoing trees of F. Label 
these trees Ti , . . . , in a particular order, such that 
the following property holds: if the minimal cluster 
including E{Tj) is a proper descendant of the mini- 
mal cluster including E{Ti), then j < £. We now de- 
fine i'j = [log2max„gy(T_,)nactivo(C) dist('u,i9C)] and 
ij = max{z^, fi — [log2 7] — 1} and let Qj C £j be 
a minimal set of regions such that Uqgq Q covers 
the vertices of {Tj n C)/B^ . This defines the radii 
and sets of regions, hence also specifies the associated 
partition (Pi, . . . , Pd) of active(C). 

The construction of F' is in two steps. First we go 
from F to Fi by adding some edges, then we go from 
Fi to F' by adding more edges. 

First step: starting from Fi := F, modify Fi by 
processing clusters C € C in top-down order. Con- 
sider a cluster C. While there exists a j and an active 
vertex u that is in Pj but is not connected to Tj / 
in Fi/B'-' , add to Fi a path of C/B'^ connecting u to 
Tj/B'-' . (This first step is similar to the construction 
in Batcni el al.) 



Second step: starting from F' :— Fi, modify F' by 
processing clusters C G C in top-down order. Con- 
sider a cluster C. While there exists a pair of bound- 
ary vertices u,v £ dC such that u and v are not 
connected in {F' n C)/A^ but can be connected by 
adding at most 2p'^ edges of C/A'~^ to F' , add those 
edges to F' . 

The result of this processing defines F' . 

4.6.2 The canonical configuration of F' is 
simple 

Lemma 4.9. After the first step of the construction, 
for every cluster C we have: all active vertices of Pj 
are connected to Tj in [Fi H C)/B'~^ . 

Lemma 4.10. After the second step of the construc- 
tion, for every cluster C we have: if two vertices of 
dC are connected in {F' H C)/B^ then they are con- 
nected in F' f] C . 

Proof. It suffices to show that, after those paths 
are added, if two vertices of dC are connected in 
{F'r]C)/B^ then they are connected in (F' DC) / A^' . 
Suppose not, and let p be a dC-to-dC path in 
(F' n C)/B^ that is not a path in (F' n C)/A'^, 
chosen so as to have a minimal number of edges. 
Then, in C/A'~'', p starts at some vertex Pstart of 
SP{C/A'^\u, p'^') and ends at some vertex pond of 



SPiC/A^, 



Concatenating p with a path from 



u to Pstart at one end, and with a path from Pcnd 
to V at the other end, gives a path from u to u in 
C/A'-^ . The total number of edges thus added is at 
most 2p'^, so it would have been added during the 
processing. □ 

Lemma 4.11. For every cluster C , the canonical 
configuration of F' is simple. 

Proof. It suffices to show for all j that all active ver- 
tices in Pj are connected by F' to Tj. Let u' be an 
active vertex in Pj. Either in F u' belongs to Tj, in 
which case there is nothing to show, or u' belongs 
to some tree Ti of F. Thus u' is covered by some 
region of Qi. and by some region of Qj^,. The fact 
that u' g Pj indicates, by definition of Pj, that j 
must be less than £. Then, by the first step of the 



construction (Lemma 4.9), u' get connected to Tj in 
(Fi n C)/B'^ . Since Ti and Tj are both outgoing, in 
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F u' is connected to some vertex u G dC and Tj is 
connected to some vertex v S dC. By transitivity u 
is connected to v in {Fi Ci C) / B'^' . By the second step 
of the construction (Lemma 4.10), u is connected to 
u in i^' n C. By transitivity again, u' is connected to 

in F'nc. a 

4.6.3 Length of F' 

First we analyze the length increase when gong from 
to Fi. Since u e Pj, there exists a region Q £ Qj 
such that u £ Q. By definition, Q covers at least one 
vertex w € Tj. Thus the path added to connect u 
to Tj in (F' n C)/B^ has length at most 2/32'^. If 
= i'j, then the length is at most 4/31ength(Tj n C). 
In this case, we charge the length of this path to Tj. 
Otherwise, it's at most 4/31ength(F n C)/^. In this 
case, we charge the length of this path to F n C. 

We claim that each tree Tj of F is charged at most 
2w — 1 times by paths added when ij = i'j. Indeed, 



Theorem |2.3[ Lemma [4.8| implies that the DP takes 
time n log n(log log ti)'^^^^ , which is O(nlog^n). By 
Lemma |4.1[ the quality of the output satisfies Theo- 



whenever Tj is charged, some other tree Ti of F is 
connected to Tj in {F' n C)/B^ . After processing 
descendant clusters of C and adding connections be- 
tween boundary vertices, Tj is connected to Ti in 
F' n C. Since j < i, the minimal cluster Cj strictly 
enclosing Tj is either the same as for Ti or one of its 
descendant clusters. Either way, Ti must contain a 
vertex of dCj , so there are only at most 2w — 1 such 
trees Tf, so Tj is charged at most 2w— 1 times. Sum- 
ming over trees Tj, the total charge of those paths is 
at most 4/3(2^ - 1) length(F). 

The length charged to F n C is at most 4/3(2k7 — 
1) (3 log2 m+l)/j, since each edge is charged at most 
{2w — l)(31og2 m + 1) times. 

Second, we claim in the second step, going from Fi 
to F', each cluster C is charged at most 2w—l times. 
Indeed, each charge corresponds to two boundary 
vertices of C being connected by a path, and after 
2w — 1 paths are added, all of dC must be connected. 
Summing over clusters C, the total charge of those 
paths is at most X^cec 2(2w— l)p'^, which is at most 
2(211} - 1) length(G')/a by Lemma [45 



This completes the proof of Theorem |4.1 
4.7 Proof of Theorem [2?3 



Let 7 = (SlogjTO -f 1), /3 = e/8{2w ~ 1), and 
a — 2e^^c(2w — 1) where c is the constant specified in 
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